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201 
Start 



205 

Modify string of 
characters with heuristics 



210 

Compare to string of 
characters to known 
strings of letters 
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225 

Form sub-strings from the 
string of characters 



230 

Apply an information 
retrieval technique to the 
sub-strings 
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Perform Database Update 
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Exit Text Equivalencing 
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String of letters is 
determined to be 
equivalent to another 
string of characters 
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Update database of 
known strings of 
characters to indicate the 
equivalency 
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405 

Each character is a sub- 
string (n=1) 



410 

Segment according to 
currently known n-grams, 
favoring longer n-grams 



420 
n = n+1 



415 

Look for frequently 
appearing (n+1)-grams 
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Found new 
frequent sub- 
strings'? 
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501 \ 

Start with query string J 



502 

Separate query 
into sub-strings 



503 

Compute weight 
vector for query 



504 

Compute corpus 
statistics for all 
sub-strings 



505 

Separate all known 
equivalent texts into 
sub-strings 



506 

Compute weight 
vectors for all known 
equivalent strings 



507 

Compute dot product of query 
vector and all known equivalent 
strings 



508 

Place equivalent 
strings into score 
buckets per 
thresholds 
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